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(54) Method for molecular indexing categorising of expressed genes using restriction enzymes 



(57) This invention relates to a method for classify- 
ing (indexing) cDNA which has been reverse-tran- 
scribed from tissue- or cell-derived RNA, or DNA in a 
short period without duplication by using class-ilS 
restriction enzymes or a combination of a class-llS and 
a class-ll restriction enzymes. According to this inven- 
tion, it is possible to analyse and diagnose variations 
such as tumors easily, correctly and promptly by com- 
paring the analyzed pattern of genes expressed in a eel! 
or tissue sample with the analyzed pattern of norma! 
genes. This method is also applicable to the search and 
isolation of genes of physiologically active substances 
that are potential pharmaceuticals or causative genes of 
hereditary diseases, as well as the isolation of those 
genes that are useful for improving agricultural prod- 
ucts. 
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Description 

FIELD OF THE INVENTION 

This invention relates to a method for molecular indexing which is applicable to the analysis and diagnosis of dis- 
eases such as cancers, the search and isolation of genes of physiologically active substances that are potential phar- 
maceuticals or causative genes of hereditary diseases, as well as the isolation of those genes that are useful for 
improving agricultural products. 

BACKGROUND OF THE INVENTION 

For examining differences in gene expression between two tissues, there has been described a method wherein a 
portion (about 50-200 genes) of the expressed gene population is amplified by PGR (the polymerase chain reaction 
method) using any short primers and then separated by polyacrylamide gel electrophoresis [P. Liang and A. B. Pardee, 
Differential display of eukaryotic messenger RNA by means of the polymerase chain reaction., Science 257: 967-971 
(1992)], However, in such differential display by means of PGR, only a portion of the whole gene population is amplified 
in principle and yet a plurality of bands are generated from the same gene. Furthermore, such display involves a large 
quantity of artifacts and thus is technically incomplete. Therefore, such display only shows differences in gene expres- 
sion between two tissues which are not remote from each other or differences In gene expression in cells. Such differ- 
ential display has a problem that it cannot record the expression of individual genes. 

It is also possible to analyze vahations in tissues or cells by determining the level of a particular gene in such tis- 
sues or cells through measuring the amount of its mRNA by Northern blot hybridization method. However, this method 
is not applicable when the target gene is not cloned or the base sequence thereof is unknown. In addition, this method 
IS not suitable for the analysis of a large number of genes. For example, since genes being expressed in a certain cell 
are considered about 10,000 species, it will take for about two years even if Northern blot hybridization is performed for 
100 genes per week. Thus, this method is not practically useful. 

On the other hand, those restriction enzymes which belong to class IIS (hereinafter, referred to as "class-IIS restric- 
tion enzymes") are restriction enzymes having an ability to cut at a precise distance outside their recognition sites. 
Those fragments cut by a class-!IS restriction enzyme are characterized to have non-identical, cohesive ends consist- 
ing of several nucleotides. There have been known more than 30 class-IIS restriction enzymes including Fok I, Bsm Fl, 
Bsm Al, Bbv I, Sfa Nl and Hga L It is estimated that genes which have at least one cleavage site of Fok I, Bsm Fl or 
Bsm Al will be97% of total genes. Brenner et al. has introduced a method of preparing a more detailed genome map 
using a class-IIS restriction enzyme which generates 4-nt (nucleotide) sequences in place of conventional restriction 
enzymes [S. Brenner and K. J. Livak, DNA fingerprinting by sampled sequencing, Proc. Natl. Acad. Sci. U.S.A.. 
86:8902-6 (1989)]. There is also disclosed a method wherein a part of restriction enzyme fragments derived from a 
phage or cosmid is amplified by using those adaptors which are Complementary to all possible 4-nt cohesive ends gen- 
erated by class-IIS restriction enzymes [D. R. Smith, ligation-mediated PGR or restriction fragments from large DNA 
molecules, PGR Methods Appl. 2:21-27 (1992): Unrau. R and Deugau, K.V, Gene, 145, 163-169 (1994)]. However, 
though all of these methods employ class-IIS restriction enzymes and use the 4-nt overhangs generated by them as 
means for structural analysis of genomes, unlike the present invention, they do not aim at recording the expression of 
genes in a specific tissue or cell. 

In the Human Genome Project, there is vigorously argued an approach to take a tissue-derived cDNA fragment as 
a sample and to determine a partial sequence thereof as well as its location in a chromosome. In conventional methods, 
cDNA fragments are randomly taken from cDNA library. Accordingly, it is impossible to avoid a repeated sampling of the 
same fragment and there is a tendency that highly expressed fragments are selectively taken. 

ft is an object of the present invention to provide a method which can analyze the state of expression of genes or 
deletion due to some abnormalities in a tissue or a cell in a short period and yet easily for a large quantity of genes. 

It is a further object of the present invention to provide a method which is applicable to a rapid isolation of the coding 
region of a protein as well as an amplification of restriction fragments of cloned DNA or genomic DMA, 

SUMMARY OF THE INVENTION 

The present inventor has made extensive and intensive researches toward the solution of the above assignment 
and, as a result, found that, by using class-ilS restriction enzymes or a combination of a class-IIS restriction enzyme 
and a class-ll restriction enzyme, it is possible to classify (index) cDNA or DNA into groups in a short period and without 
duplication. Thus, the present invention has been achieved. 

The present invention relates to a method for molecular indexing comprising the following steps (hereinafter 
referred to as "Method t"); 
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(1) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA with a first restriction 
enzyme of class-ilS. 

(2) Itgating each of the resultant cDNA fragments to one from a pool of 64 biotinylated adaptors cohesive to all pos- 
sible overhangs, 

(3) digesting the resultant cDNA fragments further with a second and a third restriction enzymes of class-IIS which 
are different from the first class-IIS restriction enzyme used in (1) above to thereby obtain a first cDNA sample. 

(4) obtaining a second cDNA sample by repeating the above steps (1) to (3) wherein the second ctass-llS restric- 
tion enzyme is used for the initial digestion and the first and the third class-IIS restriction enzymes are used for the 
subsequent digestion, 

(5) obtaining a third cDNA sample by repeating the above steps (1) to (3) wherein the third class-IIS restriction 
enzyme is used for the initial digestion and the first and the second class-IIS restriction enzymes are used for the 
subsequent digestion, 

(6) recovering each of the resultant ligation samples by using streptavidin-coated paramagnetic beads and then 
removing from the samples the oligonucleotide complementary to an adaptor-primer to be used in (7), 

(7) amplifying each of the resultant cDNA samples by PGR using an adaptor-primer and one of anchored oligo-dT 
primers, 

(8) separating the amplified products by denaturing polyacrytamide gel electrophoresis and recording the sizes of 
the fragments obtained. 

The present invention also relates to a method for molecular indexing comprising the following steps (hereinafter 
referred to as "Method 11"): 

(1) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA, or DNA with a restriction 
enzyme of class-ll, 

(2) ligating each of the resultant cDNA or DNA fragments to an adaptor cohesive to ends generated by the class-ll 
restriction enzyme, 

(3) digesting the resultant cDNA or DNA fragments further with a restriction enzyme of class- ItS. 

(4) tigating each of the resultant cDNA or DNA fragments to one from a pool of 64 biotinylated adaptors cohesive 
to ail possible overhangs, 

(5) recovering the resultant ligated sample by using streptavidtn-coated paramagnetic beads and then removing 
from said sample the oligonucleotides complementary to adaptor-primers to be used in (6), 

(6) amplifying the resultant cDNA or DNA sample by PGR using adaptor-primers, 

(7) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes of 
the fragments obtained. 

BRIEF DESCRIPTION OF THE DRAwInGS 

Fig. 1 shows a schematic illustration for the principle of Method I. 

Fig. 2 shows structures of cDNA which has been synthesized by reverse-transcribing RNA using a mixture of 3 oli- 
gonucleotides as primers. 

Fig. 3 shows a schematic illustration for the principle of Method 11. 

Fig. 4 shows an example of the polyacrytamide electrophoresis pattern of mouse liver RNA obtained by Method i. 
Fig. 5 shows an example of the polyacrylamide electrophoresis pattern of an amplified product from mouse liver 
RNA obtained by Method II. 

EFFECT OF THE INVENTION 

According to Method I, it is possible to examine the state of expression of those genes having cleavage sites of 
class-liS restriction enzymes (97% of total genes are estimated to have such sites when Fok I, Bsm Al and Bsm Fl are 
used) in a tissue with one to two week experiment per one human subject, since a small number of DNA sub-groups 
will do for this analysis. Furthermore, according to Method I. since the number of fragments amplified from one gene is 
only one in principle, genes can be classified (indexed) into sub-groups without redundancy. Therefore, by comparing 
the analyzed pattern between normal and abnormal tissues by using Method I, it Is possible to diagnose variations such 
as tumors easily, correctly and promptly In addition, Method ! is also applicable to the search and isolation of genes of 
physiologically active substances that are potential pharmaceuticals, or, the causative genes of hereditary diseases, as 
well as the isolation of those genes that are useful for improving agricuitura! products. 

On the other hand, according to Method 11, the target of analysis is not limited to RNA (or cDNA reverse-transcribed 
therefrom), since oligo-dT primers for poly A are not used as primers. According to Method il, it is also possible to 
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amplify restriction fragments of cosmid DNA or genomic DNA. Therefore. Method II is applicable to the mapping of 
these DMAs. 

in addition, regions amplified by PGR is not restricted to non-coding regions and thus it is not necessary to obtain 
clones of upstream regions in order to know the primary structure of a protein. 

DETAILED DESCRIPTION OF THE INVENTION 

[I] Hereinbelow, the steps, action and effects of Method I will be described with reference to Fig. 1 . 

(1) First, the total RNA of a ceil or a tissue is converted to cDNA with a reverse-transcriptase and the resultant 
cDN A is digested with a first class-lfS restriction enzyme. 

(2) One from a pool of 64 biotinytated adaptors described below is ligated to the resultant cDNA fragments with E. 
coii DNA ligase. Each adaptor has a 4-nt 5' end overhang wherein the outermost base is a mixture of A, C, G and 
T and the inner three bases are one of all possible sequences, (These adaptors must not be phosphorylated at their 
5' ends which form protruding cohesive ends.) At this point, the restriction fragments are classified into 64 sub- 
groups. 

(3) Subsequently, the cDNA fragments are further digested with a second and a third class-ilS restriction enzymes 
which are different from the first class-tIS restriction enzyme used in (1) above to thereby obtain a first cDNA sam- 
ple. 

A second cDNA sample is obtained by repeating the above steps (1) to (3) wherein the second class-l IS 
restriction enzyme is used for the initial digestion and the first and the third ctass-llS restriction enzymes are used 
for the subsequent digestion, and also a third cDNA sample is obtained by repeating the above steps (1) to (3) 
wherein the third class-IIS restriction enzyme is used for the initial digestion and the first and the second class-IIS 
restriction enzymes are used for the subsequent digestion. 

(4) As a result of digestion with the 3 class-IIS restriction enzymes described above, there are produced fragments 
which have lost poly A [see Fig. 1 . (i)] and fragments which still have poly A [see Fig. 1. (!i)]. Of these fragments, 
the former ones which have lost poly A \A^tll no longer be amplified in the subsequent amplification step and only the 
latter ones with poly A will be amplified. Accordingly the latter fragments are further classified into 64 x 3 = 192 sub- 
groups at this point depending on the cleavage site nearest the poly A side (i.e., depending on the cleavage site of 
which of the three restriction enzymes used). 

(5) Subsequently the ligation sample is recovered with streptavidin-coated paramagnetic beads and the cDNA 
fragments are treated with a dilute alkaline solution. By these operations, the oligonucleotide complementary to an 
adaptor-primer to be used in (6) is removed (the oligonucleotide will become an inhibitor against PGR reaction). 

(6) The resultant cDNA sample is amplified by PGR by using a combination of an adaptor-primer and one of 
cl(T)25A, d(T)25C and d(T)25G which are anchored oligo-dT primers. Depending on the base (T C or G) adjacent to 
the poly(A) tail, fragments amplified by the above three oligo-dT primers are determined. At this point, the cONA 
fragments are further classified into 192 X 3 = 576 groups. 

(7) The amplified products are separated by denaturing polyacrylamide gel electrophoresis and the sizes of the 
fragments obtained are automatically recorded by a sequencer. 

The above-described procedures are repeated with 64 adaptors, 3 class-IIS restriction enzymes and 3 anchored 
oligo-dT primers. Therefore, an RNA population is classified into 576 groups. With respect to class-IIS restriction 
enzymes, it is estimated that 97% of genes have at least one cleavage site of Fok I. Bsm Al or Bsm Fl. Accordingly, by 
using these 3 restriction enzymes in the method of the invention, it is theoretically possible to recover and present with- 
out redundancy almost all of one total RNA population. 

In addition, the above method (Method t) of the invention may be similarly carried out in a modified method which 
is different from the above only in the following points. In step (2) above, one from a pool of 256 biotinylated adaptors is 
used. Each adaptor of the pool has a four-nucieotide 5' end overhang wherein the sequence is one of all possible 
sequences. The second digestion with class-IIS restriction enzymes described in (3) above is not carried out. 

In this modified method, an RNA population is classified into 768 groups since 256 adaptors and 3 anchored oligo- 
dT primers are used. 

Further, in Method !, a mixture of the following oligonucleotides may be used as primers when converting the total 
RNA from a cell or tissue into cDNA with a reverse transcriptase: 
5'0H-GGATGCTi6A-3' 
5*OH-CAGCTGTi6C-3' 
5'0H-CTCGAGTi6G-3' 

When such primers are used, there can be obtained cDNA molecules which have T G or G adjacent to poly (A) on 
the 5' side and a 6-base sequence added to the outside (3' side) of poly (A) (see Fig. 2). 
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in this case, amplification is carried out by using any one of 5' OH-GGATCCT^f^A-S' [instead of the above anchored 
oligo-dT primer d(T)o5A]. 5 OH-GAGGTGT,5G-3' [instead of the above d(T)25C] and 5* OH-CTCGAGTVgG-3' [instead 
of the above d(T)25G]. According to these procedures, analysts can be more correct because, in addition to the specif- 
icity to cDNA of only one base of the 3' end of pnmers, specificity to cDNA by the 6-base sequence of the 5' end of prim- 
ers is utilized. 

The target RNA for Method I of the invention is isolated and purified from, for example, body tissues such as hemat- 
opoietic tissues including bone marrow, peripheral blood, lymphocytes, etc. or cells in a body fluid by conventional 
methods such as the guanidine thiocyanate method and the phenol-chloroform extraction method and then incubated 
with a reverse transcriptase and deoxynbonucleotide triphosphates for reverse-transcription into cDNA. 

With respect to the class-IIS restriction enzymes used in Method I of the invention, there is no particular limitation 
as long as the restriction enzyme forms a 5'-protruding cohesive end consisting of 4 bases. Specific examples include 
commercially available Fok i (Takara Shuzo) and Bsm Al and Bsm Fl (both manufactured by NEB). These three restric- 
tion enzymes may be used in combination for the initial digestion (with one enzyme) and the subsequent digestion (with 
two enzymes). In the modified method, one of these three enzymes may be used. 

\n Method I of the invention, the biotinylated adaptor means the adaptor consisting of i) an oligonucleotide of 24-27 
nucleotides which forms a 4-nt 5' protruding cohesive end wherein the outermost base is a mixture of A. C. G and T, 
and inner three bases are one of all possible sequences, and it) an oligonucleotide which is-complementary to the oli- 
gonucleotide i). shorter by 4 bases and biotinylated at the 5' end. Thus, there are 64 kinds of the biotinylated adaptors. 

In the modified method, the biotinylated adaptor means the adaptor consisting of i) an oligonucleotide of 24-27 
nucleotides which forms a 4-nt 5' protruding cohesive end wherein the sequence is one of all possible sequences, and 
ii) an oligonucleotide which is complementary to the oligonucleotide i). shorter by 4 bases and biotinylated at the 5' end. 
Thus, there are 256 kinds of the biotinylated adaptors. 

In order to allow E. coli DNA ligase to recognize the 3 bases of a cDNA fragment adjacent to the binding site, phos- 
phorylation of the 5' ends of the above adaptors which form cohesive ends is not carried out. 

In Method I of the invention, one of the two primers used for PGR is an oligonucleotide having a common sequence 
with the oligonucleotide constituting the adaptor described above which is subjected to ligation to cDNA at 3' end (= 
adaptor-primer). As a marker which labels this adaptor-primer, those which are used in conventional analysis may be 
used. Specific examples include fluorescent dyes, radioactive materials and enzymes. 

In Method 1 of the invention, another primer used for PGR is one of three oligo-dT primers, of which 3' end base is 
A. G or G. These pnmers may be synthesized by a commercial nucleic acid synthesizer. 

[II] Hereinbelow. the steps, action and effects of Method II will be described with reference to Fig. 3. 

(1) First, DNA or cDNA of a cell or tissue is digested with a class-ll restriction enzyme (EcoRI is used in Fig. 3). 

(2) An adaptor which is cohesive to ends generated by the class-ll enzyme isijigated to each of the DNA or cDNA 
fragments with T4 DNA ligase (the adaptor must be phosphorylated at the 5' Ind which form cohesive ends). 

(3) The resultant DNA or cDNA sample is further digested with a class-tIS restriction enzyme (Bsm Al is used in 
Fig. 3). 

(4) One from a pool of 64 biotinylated adaptors described below is li gated to each of the resultant cDNA or DNA 
fragments with E. coii DNA ligase. Each adaptor has a 4-nt 5' end overhang wherein the outermost base is a mix- 
ture of A, C. G and T, and the inner three bases are one of ail possible sequences. (These adaptors must not be 
phosphorylated at their 5' ends which form cohesive ends.) At this point, the restriction fragments are classified into 
64 groups. 

(5) Subsequently, the ligation sample is recovered with streptavidin-coated paramagnetic beads and the DNA or 
cDNA fragments are treated with a dilute alkaline solution. By these operations, those oligonucleotides comple- 
mentary to adaptor-primers which will become inhibitors against PGR reaction are removed. 

(6) Amplification by PGR is carried out using two adaptor-primers. The one derived from the adaptor for ends gen- 
erated by the class-ll enzyme is referred to as "adaptor-primer 1 " and the other derived from the biotinylated adap- 
tors is referred to as "adaptor-primer 2". Details will be described aftenwards. 

(7) The amplified products are separated by denaturing polyacrylamide gel electrophoresis and the sizes of the 
fragments obtained are automatically recorded by a sequencer. 

By using a class-ll restriction enzyme, a ciass-tIS restriction enzyme and 64 biotinylated adaptors in the operations 
described above, the DNA or cDNA fragments generated by the class-ll and class-IIS restriction enzymes used can be 
separated and displayed. 

When cDNA which has been reverse-transcribed from RNA is used as a target of analysis of Method II. a cDNA 
sample is prepared as follows. RNA is isolated and purified from, for example, body tissues such as hematopoietic tis- 
sues including bone marrow, peripheral blood, lymphocytes, etc. or cells in a body fluid by conventional methods such 
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as the guanidine thiocyanate method and the phenol -chloroform extraction method and then incubated with a reverse 
transcriptase and deoxyribonucteotide triphosphates for reverse-transcription into cDNA. 

It is also possible to use DNA as a target of analysis of Method II. In this case, a DNA sample is prepared as follows, 
DNA isolated from, for example, body tissues such as hematopoietic tissues including bone marrow, peripheral blood, 
lymphocytes, etc. or a cell suspension in a body fluid is crushed with polytron or the like and incubated with proteinase 
K to thereby degrade proteins. Then, the reaction solution is subjected to phenol extraction and 2 volumes of ethanol is 
added to the aqueous layer for precipitation. The precipitate is treated with ribonuclease (RNase) not containing deox- 
yribonuclease (DNase) to thereby remove RNA. 

With respect to the class-li restriction enzyme used in Method II of the invention, there is no particular limitation as 
long as the enzyme recognizes a specific base sequence, cut the site specifically and generate cohesive ends. Specific 
examples include EcoRI, BamHI. Hindlll. Belli, Bglll, Sail, Xhol. AccI, Aval, SauSA, TaqI, Not! (which form 5'-protruding 
cohesive ends), and PstI, Sad, Kpnl, Haell (which form 3'-protruding ends), 

In particular, for the analysis of genomic DNA, restriction enzymes which recognize a 8-base sequence (e.g.. NotI) 
are preferably used. 

With respect to the ctass-llS restriction enzyme used in Method II of the invention, there is no particular limitation 
as long as the enzyme generates 4-base 5'-protruding cohesive ends. Specific examples include commercially availa- 
ble Fok I (Takara Shuzo) and Bsm Al, Bsm Fl, SfaNI and Bbvl (all manufactured by NEB). 

It is also possible to use 2 or 3 ciass-IIS restriction enzymes in combination to increase the number of groups as 
described in Method 1. 

In Method 11 of the invention, the adaptor consists of i) an oligonucleotide of 20-30 nucleotides forming a 5'- (or 3'- 
) overhang which is cohesive to ends of restriction fragments, and li) an oligonucleotide which is complementary to the 
above oligonucleotide i) and shorter by the number of bases forming the overhang. 

The adaptor must be phosphorylated at its 5' end (which form a cohesive end) so that an adaptor oligonucleotide 
is bound to the DNA strand which is recovered with streptavidin -coated beads. 

In Method II of the invention, the btotinylated adaptor means the adaptor consisting of i) an oligonucleotide of 24- 
27 nucleotides which forms a 4-nt 5' protruding cohesive end wherein the outermost base is a mixture of A, C, G and T 
and inner three bases are one of all possible sequences, and ii) an oligonucleotide which is complementary to the oli- 
gonucleotide i), shorter by 4 bases and btotinylated at the 5' end. Thus, there are 64 kinds of the biotinylated adaptors. 

tn order to allow E. coli DNA Itgase to recognize the 3 bases of a cDNA fragment adjacent to the binding site, phos- 
phorylation of the 5' end of the above biotinylated adaptor which form a cohesive end is not carried out. 

In Method II. one of the primers used in PGR is an oligonucleotide having a common sequence with the oligonu- 
cleotide constituting the adaptor described above which is subjected to ligation to cDNA or DNA fragments at its 3' end 
(adaptor-primer 1) 

In Method II of the invention, another primer used for PGR is an oligonucleotide having a common sequence with 
the oligonucleotide constituting the biotinylated adaptor described above which is subjected to ligation to cDNA or DNA 
fragments at its 3' end (adaptor -primer 2). As a marker which labels this adaptor-primer, those which are used in con- 
ventional analysis may be used. Specific examples include fluorescent dyes, radioactive materials and enzymes. 

These primers may be synthesized by using a commercial nucleic acid synthesizer. 

Potential target diseases which may be analyzed or diagnosed by Method I or Method II of the invention include 
malignant tumors such as brain tumor, stomach cancer, large intestine cancer, breast cancer, uterus cancer, skin can- 
cer, prostate cancer and malignant melanoma; virus infections such as herpes group infections, chronic hepatitis, 
cytomegalovirus infection and acquired immunodeficiency syndrome; and multifactorial hereditary diseases such as 
diabetes and hypertension. 

PREFERRED EMBODIMENTS OF THE INVENTION 

The present invention will be described in more detail below with reference to the following Reference Example and 
Examples, which are provided for the purpose of explanation and should not be construed as limiting the scope of the 
invention. 

[Reference Example] Preparation of qONA 

(1) Purification of RNA by ultracentrifugation 

Mouse livers lyophilized in dry ice or liquid nitrogen were crushed with a homogenizer. To the crushed material. 5 
volumes of a GuCNS solution was added at room temperature and agitated with a vortex mixer. To a 10 ml polyallomer 
tube, 3.5 ml of 5.7 M CsGI/0.1 M EDTA solution was added and 6 ml of the resultant sample was layered over and then 
centrifuged overnight at 15"G at 32000 rpm using Beckman L70 centrifuge. 
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(2) Recovery of the RNA after ultracentrifugation 

The tube was removed from the rotor and all of the supernatant was discarded. The tube wall was wiped and dried. 
Thereafter, the precipitate was dissolved in 300 [il of TE buffer. 
s ■ 

(3) Ethanol precipitation 

To the aqueous layer, 1/10 volume of 3 M potassium acetate (pH 5.0) was added, mixed gently and placed in ice. 
Then, 2.5 volumes of ice-cooled ethanol was added to the above mixture and mixed gently. The resultant mixture was 
10 left at -SO'^C for several hours and centrifuged at 4 for 5 minutes to precipitate RNA. The ethanol was discarded. The 
RNA precipitate was washed with ice-cooled 70% ethanol and re-centrifuged to precipitate RNA. After the ethanol was 
discarded, the RNA precipitate was dried. 

The above precipitate was dissolved in about 100 ,ul of sterile distilled water per 1 g of the tissue cells to obtain an 
RNA solution (RNA concentration := approx. 5 ^ig/ul). 

15 ■ 

(4) Preparation of cDNA template : ^ 

(4-1) Preparation of single-stranded cDNA molecules 

20 First, the resultant RNA and oligo-dT primers only were heated at 70 ''C for 2-3 minutes. Then, other reagents were 

added thereto and kept at 37 "C for 1 hour to synthesize cDNA molecules. 



Composition of the reaction solution 


5x Reverse transcriptase buffer (Gibco-BRL) 


4 fit 


2mM dNTP (Pharmacia) 


4 yi\ 


0.1M DTT 


2 lit 


10 pmol/fil 5'-amino (dT)ig 


1 Ml 


Total RNA (3 MQ) and distilled water 


7.5 Ml 


RNase inhibitor "^^ (40 u/Ml) (Toyobo) 


0.5mI 


200 u/|iil M-MLV Reverse transcriptase"^^ (Gibco-BRL) 


1ld 


V' derived from human placentas 
Molony Murine Leukemia Virus 



40 

(4-2) Synthesis of double-stranded cDNA molecules 

The reaction solution described below was added to the single-stranded cDNA reaction solution and kept at 16 
for 2 hours to thereby prepare double-stranded cDNA molecules. After the completion of the reaction, 3 |ul of 0.25 M 
45 EDTA (pH 7.5) and 2 jiil of 5 M NaCI were added thereto. Then, phenol extraction and ethanol precipitation were con- 
ducted and the precipitate was dissolved in 240 fii of distilled water. 



* Composition of the reaction solution 


1 0 mM MgCl2 


70 Ml 


1 M Tris-CI (pH 7.5) 


IOmI 


1 M (NH4)2S04 


1.5 Mi 


RNase H (Toyobo) (1 u/^l) 


1.5 Ml 


E. coli DNA polymerase 1 (Toyobo) (10 u/^l) 


4.5 Ml 
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[Example 1] Analysis by the DNA molecular indexing method 



(1) Digestion with a class-ilS restriction enzyme (initial digestion) 

The cDNA prepared in Reference Example described above was digested with a restriction enzyme by keeping the 
cDNA in any one of the following reaction solutions (A) to (C) at a specified temperature under specified conditions. 



* Composition of the reaction solution (A) (using Fok 1) 


10xM buffer 


10 ul 


0.1 % BSA (Takara Shuzo) 


10|.il 


cDNA sample 


80 Lil 


Fok 1 (Takara Shuzo) (10 u/^l) 


0.5 111 


Kept at 37 '^C for 50 minutes to 1 hour. 



* Composition of the reaction solution (B) (using Bsm Al) 


10x buffer for Bsm Al (NEB) 


10 .ul 


0.1 %BSA 


10 111 


cDNA sample 


80 Ml 


Bsm Al (NEB) (5 u/ul) 


1 111 


Kept at 55 ''C for 50 minutes to r hour. 



^ Composition of the reaction solution (C) (using Bsm Fl) 


lOxH buffer 


10 ^1 


Distilled water 


10 Ml 


cDN A sample 


80 Ml 


Bsm FI(NEB) (5 u/^i\) 


1 Ml 


Kept at 65 ''C for 50 minutes to 1 hour. 



After the completion of each of the reactions (i). (ii) and (iii) above, 3 mI of 0.25 M EDTA (pH 7.5) and 2 mI of 5 M 
NaCI were added to each reaction solution. Then, phenol extraction and ethanol precipitation were conducted and each 
precipitate was dissolved in 70 ul of distilled water. 

(2) Addition of adaptors 

To the cDNA fragments obtained in (1) above, one of the following adaptors having the sequences described below: 
C1T adaptors: 

5'-8-GTACATATTGTCGTTAGAACGCT-3' 

5'-NXYZAGCGTTCTAACGACAATATGTAC-3' 

or 

C1G adaptors: 

5'-B-GTACATATTGTCGTTAGAACGGG-3' 
5'-NXY2CGGGTTCTAACGACAATATGTAC-3' 
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(wherein B represents biotin; N represents any oi the four bases: and XYZ represents one of the 64 possible 
sequences/When YZ - AA, AT. TA OR TT CiG adaptor were used. Otherwise. C1T adaptors were used.) 
were added and kept in the following reaction solution at 16^0 overnight, to thereby ligate the cONA fragments to the 
adaptors. 

5 



' Composition of the reaction solution 


lOx E. coli DNA ligase buffer 


1 fil 


100mM{NH4)2SO4 


1 ^il 


1 pmol/|ul adaptor solution 


1 Ml 


cDNA sample digested with a class-liS restriction enzyme 


1 Ml 


E. coli DNA iigase 


3 units 


Distilled water to make 1 0 iliI 



(when the sequence XYZ did not contain G nor C. 5 pmol/^i adaptor solution and 30 units of E. coli DNA ligase 
20 were used.) 

(3) Digestion with class-1 IS restriction enzymes (the second digestion) 

The cDNA fragments obtained in (2) above were further digested with ciass-l IS restriction enzymes by keeping the 
25 cDNA sample at a specif ied temperature under the conditions specif ied below: 

(i) When a Fok I digest was used: 

40 m! of distilled water and 5 Lil of 1 0x H buffer were added. 
Bsm Fl (1 unit) was added and kept ateS'^C for 50 minutes. 
30 Bsm Al (1 unit) was added and kept at 55"C for 50 minutes. 

(ii) When a Bsm Al digest was used: 

40 fil of distilled water and 5 )Ltl of 1 0x T buffer were added. 
Fok I (1 unit) was added and kept at 37"C for 50 minutes. 
Bsm Fl (1 unit) was added and kept at 65"C for 50 minutes. 
35 (iii) When a Bsm Fl digest was used; 

40 mI of distilled water and 5 mI of 1 0x M buffer were added. 
Fok I (1 unit) was added and kept at 37"C for 50 minutes. 

Bsm Al (1 unit) and 1 mI of 4 M NaCI were added and kept at 55°C for 50 minutes. 

40 (4) Amplification by PGR 

(4-1) Recovery of the adaptor molecules with paramagnetic beads 

Immediately before use. streptavidin-coated paramagnetic beads were washed twice with 0.1% BSA and once with 
45 1x B&W buffer (10 mM Tris-CI pH 7.5, 1 M NaCl. 1 mM EDTA) and then suspended in an equal volume of 1x B&W 
buffer. 

To each sample. 15 pi of 5 M NaCI and 5 jul of the paramagnetic beads were added, left stationary for 15 minutes 
and washed with lx B&W buffer once. Then, 10 jul of 0.1 M NaOH was added thereto and left stationary for 5 minutes. 
Thereafter, the resultant mixture was washed with 50 |liI of 0.1 M NaOH once, with 1x B&W buffer once and with distilled 
50 water twice. 

(4-2) PGR reaction 

The reaction solutions having the compositions described below were placed in an Eppendorf tube and heated at 
55 96"C for 1 minute to allow a prompt initiation of reactions. Then, a thermal cycle consisting of 30 seconds at 94''G, 1 
minute at 50''C and 1 minute at 72*^0 was repeated 25 to 35 times. After an extension step was carried out at 72'^C for 
20 minutes, the reaction solution was cooled to room temperature. 
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' Compositions of the reaction solutions (per one sample) 
(i) Enzyme reaction solution 


10x PGR buffer for Stoffel fragment 


1MI 


2 mM dNTP 


1 Ml 


25 mM MgCis 


1 .2 mI 


Distilled water 


4.3 mI 


10 u/ul Stoffel fragment '^^ 


0.05 Ml 


(ii) Primer reaction solution 


1 0 pmol/^i f luorescent-Cl T 

10 pmol/jil d(T)25A [or d(T)25C, d(T)25G] 


0.5 Ml 
2 Mi 



' A portion Of AmpliTaq DNA polymerase fragment {Perkin 
Elmer) 



The primers are used in the combinations of JOE-CIT and d(T)25A: FAM-CIT and d(T)25C; and TAMRA-C1T and 
d(T)25G. [JOE: 2',7-dtmethoxy-4',5'-dichloro-6-carboxyfluorescein. RAM: 5'-carboxyfluorescein, TAMRA: 6-carboxy- 
tetramethyl rhodamine (all manufactured by Perkin Elmer; the sequence of C1T: d(GTACATATTGTCGTTAGAAGGCT)]. 

Alternatively, when C1G adaptors were used, the composition of the primer reaction solution is: 



10 pmol/Mi fluorescent-ClG 



10 pmol/Ml d(T)25A [or 6(7)2^0, d(T)25G] 



o:5 

Ml 

2m1 



The primers are used in the combinations of JO E-C1G and d(T)25A; FAM-C1G and d(T)25C: and TAMRA-C1G and 
cl(T)25G. [JOE: 2'.7'-dimethoxy-4'.5'-dichloro-6-carboxyf!uorescein. FAM: 5*-carboxyfluorescein, TAMRA: 6-carboxy- 
tetramethyl rhodamine (all manufactured by Perkin Elmer; the sequence of CTG! d(GTACATATTGTCGTTA- 
GAACGCG)]. 

(4-3) Preparation of Electrophoresis Samples 

From each of the reaction products, a sample was taken as follows: 1 m1 fi'om the combination of FAM-Cl and 
d(T)25C, 3 mI from the combination of JGE-Cl and di(T)2^A and 3 mI from the combination of TAMRA-C1 and d(T)25G. 
To each sample. 5 \a\ of T4 DPase solution having the following composition was added and reacted at 37*^0 for 40 min- 
utes. 



' Composition of T4 DPase solution (per one sample) 


10x M buffer 


0.5 Ml 


2mMdNTP 


0.5 Ml 


Distilled water 


4 mI 


T4 DNA polymerase (Toyobo) 


1 unit 



After ethanol precipitation of the reaction solution. 3.5 mI of a buffer (80% formaldehyde, 10 mM EDTA, 6 mg/ml blue 
dextran) was added to the sample (i.e., precipitate), heated at 95"C for 4 minutes, then immediately applied to the sam- 
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pie well of ABI 373 A electrophoresis apparatus (Perkin Elmer) and run (at a constant electric power of SOW for 13 
hours). 

Fig. 4 shows one example of the electrophoresis patterns obtained. 

[Example 2] Analysis by the DNA molecular indexing method 

(1) Digestion with a class- II restriction enzyme 

The cDNA prepared in Reference Example described above was digested with a restriction enzyme by keeping the 
cDNA in the following reaction solution at a specified temperature under specified conditions. 



' Composition of the reaction solution (using Eco'RI) 


lOxhigh salt buffer (attached to the enzyme) 




cDNA Sample 




EcoRI (Toyobo or Takara Shuzo) 


Sunits 


Kept at 37 "^C for 1 hour. 



After the completion of the reaction, phenol extraction and ethanol precipitation were carried out and the total pre- 
cipitate was used for the subsequent reaction. 

(2) Addition of adaptors 

To the cDNA fragments obtained in (1) above, the following adaptors 
5'-P- AATTCTTAACCAGGCTGAACTTGCTC-3' 

5'-OH-GAGCAAGTTCAGCCTGGTTAAG-3' 
were ligated by keeping the cDNA sample in the following reaction solution at 16"C for 16 hours or more. 



' Composition of the reaction solution 


lOx ligation buffer (similar to Toyobo's) 


2^1 


2.5 pmol/^il EcoRI adaptors 


2^1 


T4 DNA ligase 


150 units 


Total volume 


20 ul 



After the completion of the reaction, phenol extraction and ethanol precipitation were carried out and the total pre- 
cipitate was used for the subsequent reaction. 

(3) Digestion with a class-IIS restriction enzyme 

The cDNA treated in (2) above was further digested with a restriction enzyme by keeping the cDNA sample in the 
following reaction solution at a specified temperature under specified conditions. 
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^ Composition of the reaction solution (using Bsm Al) 


I0x buffer for Bsm AI(NEB) 


10^1 


0.1% BSA 


10 ni 


cDNA sample 


80 Mi 


Bsm Al (NEB) (5u/fil) 


0.5 Ml 


Kept at 65°C for 50 minutes to 1 hour. 



After the completion of the reaction, phenol extraction and ethanol precipitation were carried out and the precipitate 
was dissolved in 30 ^1 of purified water. 

(4) Addition of biotinylated adaptors 

To the cDNA fragments obtained in (3) above, the following adaptors: 

CI T adaptors; 5'-B-GTACATATTGTCGTTAG A ACGCT-3' 

5*-OH-NXYZAGCGTTCTAACGACAATATGTAC-3' 
C1G adaptors: 5'"B-GTACATATTGTCGTTAGAACGCG-3' 

5'-OH-NXYZCGCGTTCTAACGACAATATGTAC-3" 

(wherein B represents biotin; N represents any of the four bases; and XYZ represents one of the 64 possible 
sequences. When YZ = AT or TA, CiG sequences were used. Othenwise, 01 T sequences were used.) 
were ligated by keeping the cDNA sample in the following reaction solution at 16''C overnight. 



* Composition of the reaction solution 


IQnE. coii DNA ligase buffer 


1 Mf 


100 mM (NH4)2S04 


1 Ml 


1 pmolAil adaptor solution 


ImI 


cDNA fragments digested with a dass-ilS restriction enzyme 


1 Ml 


E. coli DNA iigase 


3 units 


Distilied water 


to make 10 mI 



(when the sequence XYZ did not contain G nor C, 5 pmol/jn! adaptor solution and 6 units of E. coli DNA ligase 
were used.) 

(5) Amplification by PGR . 

(5-1) Recovery of the adaptor molecules with paramagnetic beads 

Immediately before use, streptavidin-coated paramagnetic beads were washed twice with 0.1% BSA and once with 
1x B&W buffer (10 mM Tris-Gl pH 7.5. 1 M NaCI. 1 mM EDTA) and then suspended in an equal volume of 1x B&W 
buffer. 

To the sample, 15 mI of 5 M NaCI and 5 of the paramagnetic beads were added, left stationary for 15 minutes and 
washed with 1 x B&W buffer once. Then, 10 jiil of 0.1 M NaOH was added thereto and left stationary for 5 minutes. 
Thereafter, the resultant mixture was washed with 50 iiil of 0.1 M NaOH once, with 1x B&W buffer once and with distilled 
water twice. 
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(5-2) PGR reaction 

The reaction solutions having the compositions described below were placed in an Eppendorf tube and heated at 
96"C for 1 minute to allow a prompt initiation of reactions. Then, a thermal cycle consisting of 30 seconds at 94^C , 1 
5 minute at 50"C and 1 minute at 72"C was repeated 25 to 35 times. After an extension step was carried out at 72"C for 
20 minutes, the reaction solution was cooled to room temperature. 





' Compositions of the reaction solutions (per one sample) 


10 


(i) Enzyme reaction solution 




10x PGR buffer for Stoffel fragment 






2 mM dNTP 


nil 


15 


25 mM MgCl2 


1 .2 ^il 




Distilled water 


4.3 yi\ 




10 u/fil Stoffel fragment "'^ 


0.05 |.il 




(ii) Primer reaction solution 


20 


10 pmol/|iil fluorescent-Cl S primer 


0.5|.il 




10 pmol/iLiI A. gt10 forward primer 


0.5 nl 



A por tion of ArnpliTaq DN A polymerase fragment (Pei kin 
EIniei) 

25 



The two kinds of primers having the following sequences are used in combination: 
5*-OH-GTACATATTGTCGTTAGAAGGG-3'(ClS primer) 
30 5'-OH-GAGGAAGTTCAGGGTGGTTAAG"3*(Xgt10 forward primer) 

(5-3) Preparation of Electrophoresis Samples 

A3 111 sample was taken from the reaction products and 5 mI of T4 DPase solution having the following composition 
35 was added thereto. The resultant mixture was reacted at 37"G for j40 minutes. 



' Gomposition of T4 DPase solution (per one sample) 


lOx M buffer 


0.5|.il 


2 mMdNTP 


0.5 fil 


Distilled water 


4 ul 


T4 DNA polymerase (Toyobo) 


1 unit 



After ethanol precipitation of the reaction solution, 3.5 mI of abuffer (80% formaldehyde, 10 mM EDTA, 6 mg/ml blue 
dextran) was added to the sample (i.e., precipitate), heated at 95°C for 4 minutes, then immediately applied to the sam- 
50 pie well of ABl 373 A electrophoresis apparatus (Perkin Elmer) and run (at a constant electric power of 30 W for 13 
hours). 

Fig. 5 shows one example of the electrophoresis patterns obtained. 
Claims 

55 

1, A method for molecular indexing comprising the following Steps: 

(1 ) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA with a first restriction 
enzyme of class-IIS. 
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(2) ligating each of the resultant cDN A fragments to one from a pool of 64 biotinylated adaptors cohesive to all 
possible overhangs, 

(3) digesting the resultant cDNA fragments further with a second and a third restriction enzymes of class-IIS 
which are different from the first class-IIS restriction enzyme used in (1) above to thereby obtain a first cDNA 

5 sample, 

(4) obtaining a cDNA second sample by repeating the above steps (1) to (3) wherein the second class-IIS 
restriction enzyme is used for the initial digestion and the first and the third class-IIS restriction enzymes, are 
used for the subsequent digestion, 

(5) obtaining a third cDNA sample by repeating the above steps (1 ) to (3) wherein the third class-IIS restriction 
10 enzyme is used for the initial digestion and the first and the second class-IIS restriction enzymes are used for 

the subsequent digestion, 

(6) recovering each of the resultant ligation samples by using streptavidin-coated paramagnetic beads and 
then removing from said samples the oligonucleotide complementary to an adaptor-primer to be used in (7), 

(7) amplifying each of the resultant cDNA samples by PGR using an adaptor-primer and one of anchored oligo- 
15 dT primers, 

(8) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes 
of the fragments obtained. 

2. The method of claim 1 , wherein the class-IIS restriction enzymes are used in the combination of Fok f. Bsm Al and 
20 Bsm Fl. 

3. The method of claim 1 . wherein the cDNA has been synthesized by reverse-transcripting a tissue- or cell-derived 
RNA using a mixture of the following oligonucleotides as primers: 

5' OH-GGATCCTi6A-3' 
25 5'0H-CAGCTGTi6C-3' 
5' OH-CTCGAGTi6G-3' 

4. A method for molecular indexing comprising the following steps; 

(1) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA. or DN A with a restric- 
tion enzyme of class-l I, 

(2) ligating each of the resultant cDNA or DNA fragments to an adaptor cohesive to ends generated by the 
class-ll restriction enzyme, 

(3) digesting the resultant cDNA or DNA fragments further with a restriction enzyme of class-l!8, 

(4) ligating each of the resultant cDNA or DNA fragments to one from a pool of 64 biotinylated adaptors cohe- 
sive to all possible overhangs, 

(5) recovering the resultant ligation sample by using streptavidin-coated paramagnetic beads and then remov- 
ing from said sample the oligonucleotides complementary to adaptor-primers to be used in (6). 

(6) amplifying the resultant cDNA or DNA sample by PGR using adaptor-primers, 

(7) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes 
of the fragments obtained. 

5. The method of claim 4. wherein the class-IIS restriction enzyme is any one of Fok I, Bsm Al, Bsm Fi, Sfa Nl or Bbv I. 
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FIG. 2 
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FIG. 3 
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FIG. 4 
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Horizontal axis: Fragment size 
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Primers: (from the upper panel ) FAM-CIT and d(T)rv C; JOE-CiT 

and d{:T)r5 A; TAMRA-CIT and d(T)ziG 
Adaptor: NCCG 

Sample: Mouse liver cDNA (Fok I was used for the initial 
digestion . ) 
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FIG. 5 




Horizontal axis: Fragment size 
Vertical axis: Fluorescence intensity 

Primer: TAMURA CIS 

Adaptor: NAAC for the upper panel, NCAG for the lower pane 
Sample: Mouse liver cDNA (digested with EcoRI and Bsm AI) 
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